LARGE DEVIATIONS FOR THE EMPIRICAL DISTRIBUTION IN 
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Abstract. We consider the branching random walk (Z n ) n >o on R where the underlying 
motion is of a simple random walk and branching is at least binary and at most decaying 
exponentially in law. It is well known that Z n (A) — > v{A) almost surely as n — > oo 
for typical A's, where Z n is the empirical particles distribution at generation n and 
v is the standard Gaussian measure on M. We therefore analyze the rate at which 
F(Z n (A) > v{A) + e) and P(Z n (A) < v(A) - e) go to zero for any e > 0. We show that 
the decay is doubly exponential in either n or t/ti, depending on A and e and find the 
leading coefficient in the top exponent. To the best of our knowledge, this is the first 
time such large deviation probabilities are treated in this model. 

1. Introduction and Results 

In this work we analyze the decay of probabilities of certain unlikely deviation events 
involving the Branching Random Walk (henceforth BRW). As far as we know, very little 
has been done in this direction, although, after optimal law of large numbers and central 
limit theorem type results have been fully obtained, both the question and the events 
we consider seem to us natural and fundamental. To fix notation and context, we begin 
by briefly describing the model fll.il) and giving a short account of some of the relevant 
results in its analysis ( 11. 2ft . A precise statement of the contribution in this paper then 
follows ( II. 3ft . and finally the idea in the proof of the main theorem is conveyed ( 11 Ah . 
Complete proofs for all statements are given in Section [2j 

1.1. Setup. The BRW model traces the evolution by means of reproduction and motion 
of a population of particles on the real line, carried out synchronously in discrete steps 
or generations. We denote by Z n (henceforth the particles measure) the population at 
time n — 0,1, ... , which we describe as a point measure on K with a mass 1 per particle. 
The process is formally defined as follows. Initially there is a single particle at the origin 
Zq = do- It evolves in one generation to a random point measure Z\. Although one 
may consider any law for Z\, often and in this paper as well, attention is restricted to 
evolution by means of independent reproduction and motion. That is, Z\ is realized 
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by the particle giving birth to a random number of descendants, dying, and then all 
descendants independently of each other and of their number moving according to some 
common spatial distribution F. 

At any further generation n > 2 we have (conditioned on Z„_i), 

Zn= W 

where Zf(-) has the same distribution as Z\{- — x) and {Zf : x G Z n _{\ are independent. 
Here and later, for a point measure ( with integer masses, we write x G ( iff x is an atom 
of C, that is if ((x) := (({x}) > 0. We use (x : x G () for the multi-set of atoms of (, 
where each atom x is repeated ((x) times. Moreover, if this multi-set is used as an index 
set (as above), different copies of the same atom are considered different indices. 

Despite the old age of this model it is still quite central in pure and applied probability. 
It remains a popular model for describing and analyzing phenomena in various applied 
disciplines, such as biology, population dynamics and computer science. At the same time, 
due to the fundament ality of the stochastic dynamics it captures, it is frequently found 
in various seemingly unrelated mathematical models (e.g. the Gaussian Free Field 
Interacting Particle System [H]). Finally, there are aspects of the model which are still 
not understood or only beginning to be understood now (e.g. its extremal process [2]). 
For the classical theory of BRW, we direct the reader to the survey by Ney [!19j and the 
books by Revesz [21] and Harris [T3] . 

1.2. Known Results. Since the population-size process (|Z„|) n >o = (Z n {R)) n > is a 
standard Galton Watson process, it is well known that once reproduction is super-critical 

(3:=E\Z 1 \>1 (2) 

and assuming 

E|Zi|log|Zi| < oo (3) 
then for the normalized particles measure Z n = (3~ n Z n we have almost surely 

lim \Z n \ = \Z\ , (4) 

n— »oo 

where \Z\ is some non-negative random variable with E|Z| = 1. The optimal version of 
this theorem is due to Kesten and Stigum [17]. If (3 < 1, the population dies out with 
probability 1; hence from now on, we shall assume ([2]). 

When displacement is considered as well, an analogous result to the above, conjectured 
by Harris [13], first proved by Stan [22], and then proved under optimal conditions by 
Kaplan [16J is 

lim Z n (^A) = \Z\u(A) P-a.s. , (5) 

Here A E A where 

Aq := {(-oo,x] : x G R} , (6) 
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v is the standard Gaussian measure on K, and the assumptions are (j2J), (EJ) for branching, 
and zero mean and unit variance for the motion, that is 

,dF(,) = ; /x>dF ( ,)=l. (7) 

Combining (]3J) and (jSJ) and denoting the empirical •particles distribution by Z n = Z n /\Z n \, 
we have 

lim Z n (v^A) = . (8) 

n— >oo 

Once leading order asymptotics (jlj), (JHJ) have been obtained, second-order terms, or 
the question of the rate of the convergence, can be approached. For the population size, 
Heyde [H] has shown that under E|Zi| 2 < oo, for some (explicit) a > 0, as n — > oo 



Z\- 1/2 f3 n/2 (\Z n \ - \Z\) =>N(0,1). (9) 



For the particles measures, more recently Chen [T5] has proved that for all A G *4.0) 

^(Z n (^A) - \Z\u(A)) = v?i(n)|£| + ai M + o(l) , (10) 

as n — > oo, where a\ > 0, <^i(') is a bounded function, and M is some random variable - 
all explicitly defined. In the case he considered, motion is of a simple random walk and 
branching admits the same assumptions as in Heyde's. 

Having settled the main questions in the "typical deviations" regime, it is natural to 
turn to the regime of atypical or large deviations. Results here are not as abundant. For 
\Z n \, Athreya [3] has considered the following probabilities: 

¥(\\Z n+1 \/\Z n \-/3\ > A) and P(| \Z n \ - \Z\ \ > A) , (11) 

for A > and under the assumptions of exponential moments and \Z\\ > 1. If p := 
P(|^i| = 1) > 0, he showed that the probability on the left is 

Ao(A)p n (l + o(l)) (12) 

for some explicitly defined Ao(A) > and otherwise, it is at most 

ai(A)exp(-Ai(A)6 n ), (13) 

where b is the first integer for which P(|Zi| = b) > and Ai(A),«i(A) > 0. For the 
probability on the right, he obtained the bound 

F(\\Z n \ - \Z\\ > A) < C7exp ( -C'A 2/3 (/3 1/3 ) n ) . (14) 



Above C,C > are some universal constants. See also [20] . Different atypicality is 
treated by Jones (15] and Biggins and Bingham (7] who investigate the left and right tail 
of \Z\. 
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For the BRW, much effort has been directed into estimating the number of particles 
which deviate linearly away from the mean displacement in the underlying motion. It is 
a classical result by Biggins that for any A e Ao, 

lim n" 1 dog Z n (n A) = - inf A*(x) P-a.s. , (15) 

if the r.h.s. is positive and otherwise Z n (nA) — » a.s. Here A* is the Legendre-Fenchel 
transform of A(9) = logE J e 6x dZi(x), which is assumed to be finite. This can be also 
used to obtain the speed of the left (or right) most particle as inf {a; : A*(x) < 0}, although 
to obtain sharper results, different methods have been used (c.f. Brahmson [§1 HOj. and 
Addario-Berry and Reed [1]). 

Perhaps closest to the type of large-deviation analysis we do here is the result by Athreya 
and Kang in [3], where instead of a motion in R, particles move according to some positive- 
recurrent Markov chain with invariant measure tt. Along with a local version of ©, they 
find that the probability that at time n the fraction of particles at state s is at least A > 
away from 7r(s) decays exponentially as X(A)p n for some explicit A(A) > and with p as 
in fll2p . which is assumed to be positive Nevertheless, this is still quite far from what we 
do here. First, random walk is typically null recurrent (unless degenerate). Second, there 
is no spatial component (e.g. CLT-type phenomenon) to their problem. Third, we in fact 
assume p\ = and thus obtain very different decay scales. 

1.3. New Results. In this work we analyze large deviation probabilities of the form: 

P(|Z n (VrL4) - v{A)\ > A) . (16) 

for some A > 0. In light of ©, the above clearly decays in n and we aim to understand 
how fast. 



Assumptions. We make the following assumptions. For branching, we shall assume that 
\Zi\ is non-deterministic, that Ee e ' Zl ' < oo for 9 in some neighborhood of and that 
P(|Zi| > 2) = 1. The last condition guarantees that exponential growth of the population 
size is unavoidable. Although the case of P(|Zi| > 2) < 1 is an interesting problem, 
it is of a different nature as it permits using strategies which suppress the branching in 
order to realize large deviation events. This will result in a different scale for the decay in 
(Tl6|) . For the underlying motion, we shall assume simple random walk steps. The precise 
step distribution will not change the result, as long as it has mean zero and bounded or 
sufficiently decaying tails. Again, allowing for steps with fat tails would have given rise 
to strategies which exploit these tails for achieving the unlikely events, resulting again in 
a problem of a different nature and a different scale for the decay of (I16p . 
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We are now ready to state our main result. Let A be the algebra generated by Aq 
(defined in (151)). For A G A non-empty and p G (0, 1) define 

I A {p) = inf-OI : v(A-x) >p, i£i}, (17) 

J A (p) = inf {r : supz/((A - x)/\/l - r) > p , r G [0, 1)} (18) 

and with 6 = min{£; : P(|Zi| = k) > 0} > 2, set 

I A {p) = (log 6) J A (p), (19) 

J A (p) = (log 6) J A (p). (20) 

Then, 

Theorem 1. For AGi non-empty and p G (0, 1) suc/i that p > v{A), 



log [ - logP^CvGA) > p)] ~ ( < °° 

L v v >—rn l j A [p)n otherwise. 



(21) 



as n — >■ cxd. 



Replacing A with A c in Theorem [TJ one has 
Theorem 1'. For all A G ^4 \ R and p G (0, 1) such that p < u(A), 

log [ - log P( 4 (V5U) < P)] - { f ot^se.^ < °° < 22 > 

as n -> oo. 



As follows from Proposition [3] below, for A and p as in the conditions of the theorems 
either Ia(p) G (0, oo) or /^(p) = oo and Ja(p) G (0, log 6). Thus on a double-exponential 
scale, Theorem [1] and 1' capture the right first-order asymptotics for the decay of the 
probability of a large deviation in the empirical distribution for such A's and p's. 

The statement in the theorem still holds if we replace the weak inequality in ( )2T|) or 
(122]) by a strong one. Our proof for the lower bound on P(Z n (i/rL4) > p) essentially works 
for F(Z n (^iA) >p). 

The restriction to intervals of the form (— oo, x], (y, x] and (y, oo) in A is quite arbitrary 
and the theorem still holds if A is the algebra generated by sets of the form (oo,a;) or 
more generally, the set of all finite unions of disjoint intervals which either contain their 
endpoints or do not, or contain only one of them and can be finite or infinite, but as long 
as their interior is non-empty. 

On the other hand, (I2ip cannot be expected to hold for all Borel sets, nor even all 
continuity sets of v. Indeed, the following shows that there are simple enough sets for 
which the decay in (TIT)]) has neither linear nor radical rate on a double exponential scale. 
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Proposition 2. For all a £ (1/2,1) and p £ (0,1), there exists a set A, which is a 
countable union of disjoint finite intervals, such that 

log [ - logP(Z n (v^A) > p)] ~ n a (23) 

Similarly, the restriction in our main theorem to values of p in (0, 1) is essential. In 
Theorem [TJ for instance, in the case p = the probability in the l.h.s. of (I2T]) does not 
decay, and for certain sets in A, the case p = 1 cannot be handled by the current proof 
nor a straightforward modification of it. 



1.4. Idea of Proof. It is usually the case in the realm of large deviations that obtaining 
decay asymptotics for probabilities of unlikely events amounts to finding (and proving 
that it is such) an optimal (that is least "costly" in terms of probability) "strategy" 
for realizing the unlikely event. Consider therefore A £ A and p £ (u(A), 1) as in the 
conditions of Theorem [TJ What is the optimal strategy for having at least p fraction of 
the population in the set \JnA at time n instead of the likely v{A)l 

As it turns out, among all possible strategies one needs to consider only two: a shift 
strategy and a dilation strategy. In the former, all particles move together in either the 
left or right direction for w = \x\y/n generations (up to integer rounding, x £ R). This 
can be done with probability exp(— &l x l^( 1 +°( 1 )) by keeping the number of particles at its 
minimum. Relative to the position of the particles at generation w, the target set has now 
"shifted" by —xyfn. Therefore after dividing by the CLT scaling of y/n, each particle at 
generation w will typically have (asymptotically) a fraction of v{A — x) of its descendants 
in \/nA, and this will also be the fraction for the entire population. Consequently, if there 
exists x for which u(A — x) > p, this strategy will realize the event {Z n (\/nA) > p} at the 
sole cost of "steering" the population for w generations. This cost is exp(— ^A{p)Vn{x+o{x)^ 
once x is chosen closest to 0. 

If there is no x for which u(A — x) > p, a "dilation" strategy is employed, whereby all 
particles move together for w' = r'n + x 1 \fri generations [x! £ R, r' £ (0, 1)) such that at 
generation w' they are all at position x'y/n . If r', x' are chosen such u((A — x') / yl — r') > 
p then as in the shift case, the typical overall fraction in \JnA at a large time n will be 
at least p. The probabilistic cost of this strategy is therefore incurred just in the first 
w' generations, and by keeping reproduction at its minimum, it can be exp(— 6 r ' ?1 ( 1+0 W)). 
Choosing the smallest r' possible, {Z n (\/nA) > p} can be achieved by a strategy which 
has probability exp(- e - jA(p)n(1+o(1) ). 

Of course these strategies only give lower bounds for the probability in question. One 
therefore must also show that other strategies would not cost less. In addition, to make the 
above heuristics precise, our proof requires certain uniform estimates for the probabilities 
of finding typical fractions as well as coarse (a priori) estimates for finding atypical ones. 
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2. Proofs 

In this section we provide proofs for the statements in (II. 3p . We first introduce further 
notation ( 12.1 j) which will be used in the proofs then prove various preliminary statements 
( 12.21) which are required in order to make the ideas from (11.41) precise. We then prove the 
main theorem (I2.3P and finally prove Proposition [5] (12.41) . 

2.1. A bit more notation. The space of all particles measures, that is, finite point 
measures on R with integer masses, will be denoted by Z. For ( G 2, we denote by 
(Z^) n > a BRW process with a similar evolution as (Z n ) n > , only that initially Z = (. 
We will write Z% in place of Z^ x for short. v n is the distribution of the position of a simple 
random walk after n steps. For u G K, as usual, u + = max(0,w) and u~ = —(—u) + . We 
will use C, C, C" to denote positive constants whose value is immaterial and changes 
from one use to the other. Constant values which are used more than once are denoted 
Co,Ci, .., and their values become fixed the first time they appear in the text. 

2.2. Preliminaries. 

Proposition 3. Let A £ A be non-empty and p G (0, 1). 

(1) (p,0^v( P A + 0eC°°(R 2 ). 

(2) If Ia{p) G [0, oo) then there exists x G M. with \x\ = Ia{p) such that 

v(A — x) > p . 

(3) Ja(p) G [0, 1) and there exists i6K such that with r = Ja{p) 

u((A - x)/Vl - r) >p. 

(4) Ifp > u(A) then either I A (p) G (0, oo) or I A (p) = oo, J A (p) G (0, 1) 

Proof. Part [1] follows from the dominated convergence theorem and standard arguments 
once we write 

u(pA + t)= / —e-^pdt (26) 

since the integrand is in C°°(1R 2 ). 

For part [2] and [3J if A — R, then Ia{p) — Ja{p) — 0, and there is nothing to prove. 
Otherwise, define 

cp A {r, x) = v((A - x)/VT^) (27) 
which is in C°°([0, 1) x M) by part [H Therefore {x G R : (Pa(0,x) > p} is a closed set, 
which, if non-empty, must contain a minimizer of | ■ | . This shows part [2j 

For part [31 if A contains a half-infinite interval, then since <^a(0, x) — > 1 > p if x — > +oo 
or x — > — oo, we must have Ia(p) < oo. Therefore Ja(p) = 0, and (125]) is satisfied with 
r = and x from part El Otherwise, A is a finite union of finite intervals, and so there 
must exist R < 1, M < oo such that 



(24) 
(25) 
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• (Pa(t, x) > p for some < r < R and x with \x\ < M. 

• fA{ r , < p/2 for all < r < R and x with \x\ > M. 

Thus, Ja{p) is the infimum of the continuous function r over the non-empty compact set 

{(r, x) : (p A (r, x) > p, < r < R, \x\ < M} , (28) 

which gives part [3j 

Finally, if p > v{A), then > by part [2j At the same time, if Ja{p) = 0, then 

Ia(p) < oo by part [3j This takes care of part HI □ 

Below is a standard result concerning the uniformity of the convergence to the Normal 
distribution under the CLT. 

Proposition 4. Let A C R be a continuity set ofi>(A), i.e. u(dA) = and R > 0. Then, 
lim sup s\xp \v n {y/n(pA + £)) -v(pA + g)\ = 0. (29) 



Proof. By Theorem 2 in [8], it is enough to check that 

lim sup v({d{pA + g)) s ) = 0, (30) 

where for a set D C R, we set D 5 := {x G R : inf^ e o |x — y| < 5} and the supremum is 
over p and £ as in the statement in the proposition. Since v is equivalent to A, Lebesgue 
measure on R, we may show (|30~!) with A in place of v. But, 

\((d( P A + 0Y) = A(p(M)^ + C) < i*A((&4H , (31) 
The last term goes to as 5 — > 0, since X(dA) = 0. □ 

We shall need the following uniform Chernoff- Cramer- type upper bound. 

Lemma 5. Let X be a family of random variables on R with zero mean such that for 
some 9q > 

supEe 9oX <oo and sup E(X~) 2 < oo . (32) 

Then there exists C > such that for any A > small enough, any m > 1 and Xx, . . . , X m 
independent copies of random variables in X 

rn 

P (^E X » > A ) <^° A2m (33) 

i=l 

Proof. Using the exponential Chebyshef's inequality we may bound the l.h.s. in (133]) for 

any < 9 < 9 1 < 9 by 

m 

exp { - m(A9 - rn' 1 ^ L Xi {0j) } , (34) 
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where we use Lx(0) = \ogKe ex for the log moment generating function of X. Since Lx(0) 
is in C°°([0,#o)) due to (13"2j) . we may use Taylor expansion to write (note that the first 
two terms are 0) 

L x (9) = \L" x {d)d 2 , (35) 

for some 9 G (0, 9). Now if we denote by Mx{0) = Ke 9X the moment generating function 
of X then 

£ . = m9>M*®-im9>r < CMx{6o) + m - f . (36) 

This follows since Mx{9) > 1 via Jensen's inequality and since 

M x {9) = EX 2 e° x < EX 2 1 X<0 + L7Ee e ° x l x > , (37) 

for some C > independent of X G X. Therefore f[3"21) implies that there exists K > 
for which 

sup sup L" x {9) < K (38) 
XeX ee(o,6»i) 

and thus 

m 

Ae-m^Y^LxM > A0- |^ 2 . (39) 

i=l 

Using this bound with = A/if in ( |34|) and assuming A is small enough, the result 
follows with C = (2K)- 1 in ([33]). □ 

The last lemma can be used to prove the following. 

Lemma 6. There exists C,C > such that for all A > sufficiently small, A C K, 
( 6 2 and n > 1, 



Kl 



(40) 



zee 

TTte same ZioZds «/w;e replace > with < and +A with — A. 
Proof. Starting with the first inequality and using 

Z<(A)= ^'« Z ;~ A \ (41) 
the l.h.s. of pOjl is bounded above by 

E ^( R ) < 1 - f ) + E %& A ) > w\ E ^ - + 1 ) ( 42 ) 

i£( zee xe£ 

as long as A is small enough. Now Theorem 4 in [3] gives a uniform bound on the 
moment generating function e eZn ^' for all n > 1 and 6 G [0,#o], for some 9q > 0. This 
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uniform bound can be extended to include also the moment generating functions of (the 
stochastically smaller) Z%(A) for all A C R and x G R. in the same range of 9. The 
non-negativity of all these random variables imply that we may extend the bound also to 
all 9 < 0. Thus, it is not difficult to see that the family of random variables 

X = {±(Z n (A) - v n {A)) : n>l,ACR} (43) 

satisfies the conditions in Lemma 0, whence (|42p is bounded above by Ce~ c ' A2 ^ for some 
C, C > as desired. 

Replacing A with A c , we obtain (j4"0|) with <, —A in place of >, +A. □ 

We shall need the following uniform lower bound on the probability of a typical devia- 
tion of Z n from the Gaussian distribution. 

Lemma 7. For all A G A, t > there exists e = e(t, A) > such that 

liminf F(Z n (y/nA) > v(A)+t/y/n) > e. (44) 

Moreover, we may choose the e 's such that for fixed A G A and t > 0, 

inf e(t,A') > 0, (45) 

and the above limit with A in place of A is uniform in A, where A = pA + £ for (p, £) 
m am/ compact subset of (0, oo) x (— oo, +oo). The same result holds with < in place of 
> and —t/y/n in place of +t/y/n. 

Proof Consider A = pA + £ for some (p, £) G (0, oo) x (— oo, +oo). We may write 
y/n(Z n (\/nA') — v(A')) as (recall the definition of \Z\ in (j4])), 

y/E(Z n (y/EA') - g|KjO) KAQ 

|^n| \Z n \ 

Now Theorem 4.2 in [21] states that 

E(|Z n |-|Z|) 2 = 0(/3- n ), (47) 

from which it follows by Borel-Cantelli that \fn(\Z n \ — \Z\) — > 0, a.s. At the same time 
Corollary 2.3 in [T2] (notice that the typo 0(1) instead of o(l) there) implies that for 
some positive C ,C%, 

liminf y/n(Z n (<y/nA') - \Z\u(A')) > -C \Z\ - gJa P-a.s., (48) 

where M = lim^oo M n and M n = J xdZ n . The sets considered in the corollary are 
of the form (— oo,y], but it is clear that by summation one can extend it to all sets in 
A. Furthermore, it is immediate from the statement of the corollary that the constants 
C ,Ci can be chosen independently of A = pA + £ as long as (p, £) are chosen from a 
compact subset of (0, oo) x (— oo, +oo). Less immediate, but still true, is that the proofs 



M\Z n \ - \Z\) (46) 
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of the corollary and Theorem 2.2 on which it is based in fact give that the above limit is 
uniform in all such A'. Combining all the above and writing M for M/\Z\ we have, 

liminf inf y/n(Z n (y/nA') - v{A)) > -C - C X M P-a.s. (49) 

n— >oo A' 

and it remains to show that M is unbounded from below. 

To this end, note that M = lim^oo M n where M n = M n /\Z n \, and that for any integers 
r < n, symmetry of M n _ r around zero entails 

P(M n < -r) > F(Z r = <L r )± > C (50) 

where C = C(r) > does not depend on n. Therefore P(M < — r) > 0, and since r is 
arbitrary, M is indeed unbounded. This shows ( jH)) and f T45]) . 

Finally, applying the above results to A c in place of A, we obtain the same lower bound 
for the probability of a deviation to the opposite side. □ 

2.3. Proof of Theorem Q]. Fix A and p as in the conditions of the theorem. There are 
two cases to consider, according to whether Ia(p) is finite or not. 

2.3.1. The Case Ia{p) < oo. Let x be such that u(A — x) > p and \x\ = Ia(p) > 0, as 
guaranteed by Proposition [31 

Lower bound. Set 

w = l\x\\/n\sgn(x) ; m = n — |w| ; ( = b' w '<5 w (51) 

and write 

nZ n (Vn~A) >p)> ¥(Z H = C) nzi(Vn~A) > p) (52) 

The first factor can be lower bounded by exp{— C6' w '} as the event {Z\ w \ = (} is equivalent 
to having all particles in the first \w\ generations give birth to b children, all of whom 
take either a +1 step or a —1 step, depending on the sign of x. This requires that at most 
C'b\ w \ independent particles make certain branching/ walking choices, all of which have a 
uniformly positive probability. 

The second factor in fl52|) can be bounded below by 

(F(Z m (Vn~A-w) >p)f l . (53) 
The probability in the above expression is further bounded below by 

P (z m (^(^-,) + ^))>^-,)). (54) 

which, for p = and £ = Xv J^ w , is is equal to 

F(Z m (vMp(^ - ^) + 0) > "(P( A ~ x ) + + "( A - x )~ "(P( A + 0) • ( 55 ) 
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Now since p = 1 + 0(l/y/n), £ = 0(l/y/n), part CD of Proposition [3] implies that v(A — 
x) — v(p(A — x) + £) = 0(1/ y/n), whence we may find t > large enough such that (|5"5"j) 
is bounded below by 

P(Z m (v^(p(A - x) + 0) > Kp(A - x) + + t/y/m) (56) 

This is bounded away from uniformly in n via Lemma [7J 

Plugging this back into (153]) . recalling that |£| = b^ w \ the second factor in fl52|) is 
bounded below by exp{— Combining the bounds on both factors in ( 1521) we arrive 
at 

P(Z„(VnA) > p) > exp{-Cb lxlVE } = exp { - e G°g&)^(p)^+C j ^ 

as desired. 

Upper bound. Let e > be arbitrarily small and set 

\ w e\ — L(M — e )V^J i m e = n—\w e \. (58) 
Conditioning on the particles measure ( at generation \w e \, we have 

P(Z n (vfiA) > p) = ^P(^ t (v^A) > p)P(Z w = C) • (59) 

C 

Any such ( must satisfy supp(C) Q [— |w e |, +|w e |]. Therefore there exists 5 > 0, such that 
for all such £ and z G C> 

v(A — z/y/n)< max z/(v4 — z) = p — 5 . (60) 

z: |z|<|ai|— e 

This follows from the choice of x and Proposition |3j 

Using this proposition and also Proposition HJ we further obtain for n large, 

^ Un .(yMA-z)<^u(J^A-^) + i<p- 6 -. (61) 

M zee 11 zee 

Then Lemma [6] implies that ¥(Z^(^/nA) > p) is bounded above by 

P ^(V^A) ^ - *) + 5 J < Ce" ™ . (62) 

As |C| > fe''" 6 ' we have from fl59l) for n large enough, 

P(Z n (v^A) > p) < exp { - e (iog6)(^(p)-6)v^-C} j ( 63 ) 
and this concludes the upper bound as e was arbitrary 
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2.3.2. The Case Ia(p) — oo. The proof in this case is technically similar to the proof in 
the previous case, although the "optimal" strategy for achieving the desired deviation is 
different. We start by setting r = Ja(p) £ (0, 1) and choosing i6l such that 



u((A - x) /y/l-r) > v (64) 
This is guaranteed by Proposition |3J 
Lower bound. Set 

q = 2[rn/2\ ; w = [\x\y/n\sgn(x) ; s = q+|w| ; ( = b s 5 w (65) 

and write 

F(Zn(Vn~A) >p)> F(Z S = ()F(Zi_ s (VKA) > p) . (66) 

The first factor on the r.h.s. is at least exp{— Cb s } since the event there can be achieved by 
having all particles give birth to b children in the first s generations, make only +1 or — 1 
steps in the first \w\ generations (depending on the sign of x), and then alternate between 
+ 1 and —1 steps in the succeeding q generations. This requires that C'b s independent 
particles make certain branching/walking choices, all of which have a uniformly positive 
probability. 

The second factor is bounded below by 

(F(Z n _ s (^A-w)>p)f l (67) 

Setting 



m = n-s , p=\— (1-r) , f = -j=— , (68) 



and using ( EH]) , we may bound below the probability in (IBTj) by 
Z m (y/m ( p— ==L + £ 

Now p = l + 0(l/i/n) and £ = 0(l/^/n) hence by Proposition [3] part (TJ there exists t > 
for which the last probability is bounded below by 

m( p Az±+A ) >u( ( p —zJL+A ) + 1 



This is uniformly (in n, large enough) positive by virtue of Lemma [71 Therefore the 
second factor in ( 166]) is bounded below by e~ c '^ > exp{— Cb s }. 

Plugging the two bounds in (166]) we obtain 

F(Z n (V^A) >p)> exp{-e (logb)s+c } > exp { - e (i°g&)^(f>Wi+°(i))} ? ( 71 ) 

as desired. 
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Upper bound. As in the previous case, let e > be small enough and set 

q e = \_(r — e)n\ ; m t = n — q e . (72) 

This time we condition on the particles measure ( at generation q e : 

nZni^A) > p) = 5>(^(VnA) > p)F(Z qt = C) • (73) 

C 

Now, from the definition of r it follows that there exists 5 > such that for all e' G [e, 2e] 

and z G R, 

"(73=7) (74) 

Therefore, for any measure ( and n large enough by Propositions |3] and H] 

i^E^(^-,)<1^k^-v^) + ^^-5- ^ 

M zee ^6C 
Using Lemma [6] we have that P(Z£j (y/tL4) > p) is bounded above by 

P (r>(Zi e (V^A) > i g ^(v^ - *) + | J < Ce^'l^l . (76) 

But if C is a possible particle measure at generation q e , then |£| > b q " . Hence from (1731) 
we obtain for n large enough, 

F(Z n (V^A) >p)< e~ Cbq * < exp { - e ^ s b){j A (p)-e)n-C^ ? ^ 
and since e is arbitrary the upper bound follows. □ 

2.4. Proof of Proposition [2]. Let a G (1/2, 1) and p G (0, 1) be given and choose a > 
such that v(Aq) = p where A Q = [—a,+a\. Fix some small 5 > and for any integer 
k > 1 set: 

x fc = A; 1+5 , r k = yjl- k- (± ^^ , A k = x k + r k ■ A . (78) 
Finally, for some k > to be chosen later, set 

A = [j A k . (79) 

fc=fco 

We shall now argue that fT23|) is satisfied with the above A, a and p. 

Lower bound. For any n large enough, set k = frT,^ x / 2 )/ (n-* 5 ) ~| ; w = \_x k y/n\ , m = n — w, 
( = b w 5 w and write 

nZn(V^A) >p)> F(Z W = C) P(^(V^A) > p) • (80) 

The first factor on the r.h.s. is at least exp{— Cb w } > exp{— & na ( 1 +°( 1 )} j as the event can 
be achieved by all particles multiplying at rate b and having their descendants take a +1 
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step for w generations. Therefore, as in the proof of the lower bound in the 1(A) < oo case, 
it is enough to show that ¥(Z m (y/nA — w) > p) is bounded away from independently 
of n. This, in turn, follows from Lemma [7] since v((*JnA — w) / y/m\ is bounded below by 

u(^(A-x k ))-0(n-^) > u((l-n-^y 1/2 (A k -x k ))-0(n-^) (81) 

> u((l - n-^y 1/2 r k ■ A Q ) - 0(n- 1 ' 2 ) (82) 

> v(A ) - 0(n-^) (83) 
= p-0(n^ 2 ). (84) 

Upper bound. Let e > be arbitrarily small and set w t = |_(1 — e)n a \ and m e = n — w e . 
By conditioning on the particles measure in generation w e , it is clear that 

¥(Z n (y/nA) > p) < max F(Z^ e (VnA) > p) (85) 

where the maximum is taken over all feasible particles measures ( for generation w e . For 
such (, we may write 

777 y~] v mt (y/nA - z) < max v mt (y/nA - z) (86) 

< maxi/( A /-^4- -*=) + 0(n~ 1/2 ) (87) 

< max y{M{A- y)) + 0(n-W) , (88) 

\y\<(l— t)n a ~'-i z V 

where for the second inequality, we have used 

limsup sup sup m l l 2 \v m (y/m(pA + £)) — v(pA + £)| < oo , (89) 

m->oo [1/2,2] £eR 

which holds for the set A in light of (2.5) of [5]. 

Consider now some y in the range of the maximum in ([88]) and find the index k of the 
closest point to y among (x k ) k > ko . We can then write 

±( A ~V)= \f±(Ak -y)U x f±(A\A k - y) (90) 



and bound the Gaussian measure of each set separately. 

The measure of the first set is upper bounded by (using Proposition [3]) 



v(y/%(Ak-x k )) = K^/ k -A ) (91) 

< u(A )-C(l- ^r k ) (92) 

< P-C((l-r k )-(^-l)) • (93) 
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For the second set in (1901) . note that from the definition of A it follows that 

A\A k -yC(-Ck s ,+Ck s ) c , (94) 
for large enough k. Then, using a standard bound on the tails of u, we obtain 

u(^(A\A k -y))<C'e~ Ck2S . (95) 

Combining the two bounds, we have 

"(y/^M ~V))<P- C" ((1 " r k ) - C'e~ Ck2S - (y^ - 1)) (96) 

Now if ko is chosen large enough, the r.h.s. above is maximized when k is the largest 
possible. At the same time, the choices of k and y imply 

(k - l) 1+s < y < (1 - e)n a - 1/2 (97) 

which gives an upper bound on k. Using this in (1961) we infer that the r.h.s. of ( 188]) is 
bounded above by 

p-C {n-^- a) /2 - (1 - e)n'^- a) /2) (1 - o(l)) < p - C'en- {1 - a) . (98) 

We may now use Lemma [6] and the fact that | £ | > h Wt to conclude that 

F(Zi t (^A) >p)< C exp(-C'e 2 n- 2 ^ a W-^ na ) . (99) 
This finishes the proof as e was arbitrary. □ 
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